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I.  INTRODUCTION 


Over  the  past  several  years,  extensive  experimentation  has  been 
conducted  on  the  ONR  motion  generator  in  an  effort  to  characterize  the 
effects  of  various  motion  parameters  on  motion  sickness  incidence. 

Early  research  focused  on  single-frequency  sinusoidal  heave  motions,  but 
subsequent  studies  have  also  addressed  the  problem  of  more  complex  motion 
conditions.  The  ultimate  goal  is  to  develop  a  model  capable  of  pre¬ 
dicting  motion  sickness  incidence  in  complex  motion  environments  char¬ 
acteristic  of  ships  at  sea. 

In  this  investigation,  we  consider  12  different  data  sets  taken  from 
various  motion  studies.  Each  of  these  experiments  limited  exposure  time 
to  two  hours  duration  and  each  used  the  occurrence  of  frank  emesis  as 
the  diagnostic  motion  sickness  criterion.  The  use  of  frank  emesis  as 
an  experimental  endpoint,  however,  has  frequently  been  criticized  (see, 
for  example,  [8]).  Accordingly,  in  a  recent  set  of  experiments  [6], 
an  attempt  was  made  to  identify  measurable  correlates  of  motion  sick¬ 
ness.  Part  of  that  investigation  involved  characterizing  the  distri¬ 
bution  of  time  to  first  emesis  data.  In  section  II  we  present  further 
evidence  indicating  that  the  susceptible  portion  of  the  subject  pop¬ 
ulation  follows  a  Weibull  probability  model. 

The  study  mentioned  above  made  use  of  dual-frequency  waveforms  as 
a  first  step  toward  understanding  the  effects  of  broadband  motions.  A 
primary  question  was  whether  there  were  any  significant  differences  in 
severity  between  the  conditions  of  that  study.  In  section  III  we  examine 
this  question,  using  a  nonparametric  test  which  can  detect  very  general 
types  of  differences  between  the  motion  conditions. 


II.  DISTRIBUTION  OF  TIME  TO  FIRST  EMESIS 


Ideally,  any  model  of  time  to  emesis  should  be  based  primarily 
on  theoretical  considerations.  However,  lack  of  knowledge  about  the 
physical  mechanisms  responsible  for  motion  sickness,  coupled  with  large 
differences  in  individual  susceptibility,  has  forced  researchers  to  rely 
on  a  mostly  empirical  approach  to  the  modeling  problem.  Because  of 
their  known  applicability  in  a  wide  variety  of  situations  involving 
time  until  failure,  the  lognormal  and  Weibull  models  have  generally  been 
suggested.  In  this  section  we  add  further  evidence  in  support  of  the 
use  of  the  Weibull  distribution  as  a  model  for  the  portion  of  the  pop¬ 
ulation  susceptible  to  motion  sickness. 

Most  of  the  existing  experimental  data  on  motion  sickness  have  been 
obtained  in  experiments  which  limited  exposure  to  two  hours  duration. 
Because  of  this  truncation,  Mauro  and  Smith  [4]  considered  censored  log¬ 
normal  and  censored  Weibull  distributions  as  possible  models  for  time  to 
emesis.  They  found,  however,  that  neither  of  these  models  adequately 
fits  the  observed  data.  Noting  that  the  cumulative  motion  sickness  in¬ 
cidence  tends  to  stabilize  after  about  90  minutes,  they  then  postulated 
a  statistical  mixture  model.  This  model  treated  the  uncensored  portions 
(i.e.,  emesis  within  2  hours)  of  the  populations  under  study  as  complete 
samples  from  one  component  of  the  mixture  distribution.  The  Weibull 
distribution  was  found  to  fit  the  susceptible  portions  of  the  populations 
well,  while  the  lognormal  model  was  not  consistent  with  the  data. 

Thomas,  Guignard,  and  Willems  [6,7]  have  advocated  a  mixture  of 
three  distributions  to  model  time  to  first  emesis,  treating  the  uncen¬ 
sored  portions  of  the  samples  as  mixtures  of  two  Weibull  distributions. 
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corresponding  to  highly  susceptible  and  less  susceptible  portions  of 
the  populations.  While  not  inconsistent  with  the  data,  this  hypothesis 
seems  to  be  untenable  in  light  of  the  findings  of  Mauro  and  Smith. 

Since  it  has  not  been  contradicted  by  the  data,  the  more  parsimonious 
model  is  preferable. 

The  data  used  in  this  investigation  were  taken  from  a  number  of 
sources.  Seven  data  sets  from  various  single- frequency  studies  [5] 
have  been  used,  along  with  one  single-frequency  and  four  dual- frequency 
data  sets  from  the  "Correlation  Study"  [6],  A  complete  listing  of  the 
data  is  given  in  the  appendix.  These  are  the  same  data  sets  analyzed 
previously  by  Mauro  and  Smith.  However,  in  that  study,  "quitters"  were 
not  included  in  the  analysis  of  the  Correlation  Study  data.  (Quitters 
were  included  in  the  single  frequency  data.)  There  is  substantial  evi¬ 
dence  (see,  for  example,  [3])  indicating  that  those  who  quit  before  the 
completion  of  the  experiment  (and  prior  to  emesis)  are  very  similar, 
as  a  group,  to  those  who  become  sick.  Therefore,  in  this  investigation 
the  quitters  have  been  included  as  part  of  the  susceptible  population. 

Mauro  and  Smith  applied  goodness-of-f it  tests  appropriate  for  log¬ 
normal  and  Weibull  samples.  Since  the  data  sets  being  used  here  have 
been  augmented  by  the  addition  of  quitters,  those  tests  have  been  repeated. 
An  additional  test,  directly  comparing  the  lognormal  and  Weibull  distri¬ 
butions,  is  also  used.  The  power  of  the  goodness-of-f it  tests  depends 
on  the  exact  nature  of  the  departures  from  the  specified  distributions. 
Therefore,  it  is  difficult  to  compare  the  results  from  the  two  tests 
directly  since  they  will  have  different  power  against  a  given  alter¬ 
native  distribution.  The  additional  test  given  here  provides  the  link 
between  the  two  distributions  that  is  needed  to  choose  one  over  the  other. 
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(This  third  test  is  based  on  the  assumption  that  the  data  definitely 


come  from  either  a  lognormal  or  a  Weibull  distribution.  It  must  be 
used  in  conjunction  with  the  goodness-of-fit  tests.) 

Table  1  gives  the  results  of  the  three  tests  for  the  12  motion 
conditions  being  considered.  All  tests  have  been  conducted  at  the 
a  =  .05  significance  level.  The  first  two  tests  have  been  documented 
in  [4].  The  test  statistic  for  the  third  test  is  the  nth  root  of  the 
ratio  of  maximized  likelihood  functions  for  the  Weibull  and  lognormal 
distributions,  where  n  is  the  sample  size.  The  critical  values  are 
tabulated  in  [2],  Large  values  of  the  test  statistic  lead  to  the  choice 
of  the  Weibull  distribution  while  small  values  indicate  that  the  log¬ 
normal  distribution  is  preferable.  Moderate  values  indicate  that  no 
choice  is  possible  on  the  basis  of  this  test.  The  last  column  lists 
the  distribution  which,  based  on  all  three  tests,  best  fits  the  observed 
data.  (Some  of  the  results  of  the  lognormal  goodness-of-fit  tests 
given  for  single-frequency  data  differ  from  those  reported  by  Mauro  and 
Smith  [4],  That  report  contains  some  minor  inaccuracies  which  have  been 
corrected  here.  The  overall  conclusions  have  not  been  affected.) 

The  results  in  Table  1  show  clearly  that  the  Weibull  distribution 
is  superior  to  the  lognormal  as  a  model  for  time  to  emesis.  The  log¬ 
normal  goodness-of-fit  test  rejects  at  the  a  =  .05  significance  level 
in  3  of  the  12  cases  considered.  The  Weibull  goodness-of-fit  test  does 
not  reject  for  any  of  the  data  sets.  Furthermore,  the  likelihood  ratio 
test  chooses  the  Weibull  distribution  for  5  of  the  12  data  sets.  In 
the  other  7  cases  there  is  insufficient  information  to  make  a  definite 
choice  between  the  two  distributions. 


■  -■> . 
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Table  1:  Results  of  Goodness  of  Fit  Tests 


Further  insight  into  how  the  two  models  fit  the  data  can  be 
obtained  from  an  examination  of  the  fitted  distribution  functions. 

In  each  case,  two  parameters  must  be  estimated  from  the  data.  (Both 
the  lognormal  and  Weibull  distributions  have  three-parameter  versions. 
However,  the  third  parameter  is  a  threshold  parameter  which  seems  in¬ 
appropriate  in  this  situation.  We  have  found  no  evidence  of  any  lag 
before  subjects  feel  the  effects  of  the  various  motion  conditions.  The 
threshold  parameter  has,  therefore,  been  set  to  zero.)  The  lognormal 
probability  density  function  is  given  by: 

f  (t)  =  ~fa \  t  exp  ' 

The  Weibull  probability  density  function  may  be  written  as: 


-(In  t  -  y)' 

2~P 


t  >  0,  o  >  0. 


g(t) 


c/b(t/b)C  1 


t  >  0,  b  >  0,  c  >  0. 


Both  of  these  distributions  have  been  fit  in  each  of  the  12  cases  under 
consideration.  (For  details  of  the  estimation  procedure,  see  [2].)  The 
estimated  parameters  are  given  in  Table  2.  From  these  parameters,  the 
fitted  distribution  functions  have  been  calculated  for  each  of  the  12  mo¬ 
tion  conditions.  Figures  1  through  12  contain  plots  of  these  functions 
along  with  the  empirical  distribution  functions. 

In  most  of  the  plots,  the  fitted  functions  both  seem  to  follow  the 
empirical  distribution  fairly  well.  However,  in  those  cases  where  the 
Weibull  was  chosen  previously  (see  Table  1),  the  plots  confirm  the  fact 
that  the  lognormal  distribution  does  not  fit  the  data.  The  lognormal 
tends,  in  those  cases,  to  underpredict  motion  sickness  incidence  later  in 
time  while  overpredicting  at  earlier  time  points. 
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Table  2:  Estimated  Parameters  for  Weibull  and  Lognormal  Distributions 


III. _ COMPARISON  OF  MOTION  CONDITIONS  IN  THE  CORRELATION  STUDY 


Nearly  all  of  the  existing  data  pertaining  to  motion  sickness  in¬ 
cidence  have  been  obtained  in  studies  using  single- frequency  sinusoidal 
motion.  The  Correlation  Study  was  designed  as  a  first  step  toward  the 
study  of  more  complex  motions.  Condition  I  of  the  study  consisted  of  a 
single  sine  wave  expected  to  produce  a  high  rate  of  emesis  in  the  subject 
population.  The  other  four  conditions  employed  harmonic  combinations  of 
two  frequencies  selected  to  examine  the  influence  of  relative  amplitude, 
phasing,  and  spectral  separation  on  motion  sickness  incidence.  Table  3 
summarizes  some  of  the  motion  parameters  which  define  the  five  exper¬ 
imental  conditions  of  the  study. 

The  first  question  to  be  considered  is  whether  there  are  any 
significant  differences  in  motion  sickness  incidence  between  the  five 
groups  in  the  Correlation  Study.  Guignard  and  McCauley  [3]  tested  the 
hypothesis  that  the  total  proportions  of  subjects  becoming  sick  in  each 
group  were  equal.  When  the  test  was  performed  without  the  inclusion  of 
quitters  in  the  samples,  it  was  not  significant  at  the  a  =  .05  level. 

(It  was  significant  at  the  a  =  . 10  level.)  With  the  inclusion  of  quitters, 
the  p-value  of  the  test  was  approximately  .02.  (The  p-value  is  the 
lowest  significance  level  at  which  a  test  will  reject  the  null  hypothesis.) 

The  test  given  in  [3]  treated  the  observations  as  dichotomous.  It 
considers  only  whether  or  not  a  subject  became  sick,  disregarding  the 
observed  time  until  emesis.  It  seems  reasonable  that  if  two  motion  con¬ 
ditions  induce  emesis  in  the  same  total  proportion  of  subjects,  but 
subjects  becoming  sick  in  the  first  condition  do  so  at  earlier  times 


I 

ii 

Condition 

III 

IV 

V 

Fundamental: 

Frequency  (Hz) 

0.17 

0.16 

0.16 

0.17 

0.16 

rms  Acceleration 

(g) 

0.14 

0.14 

0.14 

0.12 

0.08 

Harmonic: 

Frequency  (Hz) 

— 

0.32 

0.33 

0.50 

0.33 

rms  Acceleration 

(g) 

— 

0.15 

0.14 

0.29 

0.26 

Phase  Angle  (deg) 

0 

+90 

0 

0 

Total  rms  Acceleration 

(g) 

0.14 

0.20 

0.19 

0.31 

0.27 

p  Table  3:  Delivered  Motion  Parameters  of  Correlation  Study  Waveforms 


than  those  in  the  second  condition,  then  the  first  condition  should  be 
considered  more  severe.  The  procedure  discussed  above  cannot  discriminate 
between  such  conditions  since  it  considers  only  the  total  proportions  of 
subjects  who  became  sick  within  two  hours.  In  this  section  we  use  a  test 
procedure  which  can  detect  more  general  differences  between  the  five 
conditions. 

In  the  previous  section,  we  showed  that  time  to  emesis  can  be 
explained  by  a  mixture  model.  The  susceptible  portion  of  the  population, 
which  varies  with  the  type  of  motion,  is  fit  well  by  the  Weibull  distri¬ 
bution.  The  rest  of  the  population  follows  some  distribution  which  can¬ 
not  be  determined  from  the  available  data.  Since  the  complete  mixture 
distribution  is  therefore  unknown,  we  treat  the  data  as  a  set  of  censored 
samples  from  various  unknown  distributions.  This  lack  of  knowledge  neces¬ 
sitates  the  use  of  a  nonparametric  test,  a  number  of  which  have  been  pro¬ 
posed  for  similar  problems. 

Of  the  available  nonparametric  tests  which  are  applicable  to  cen¬ 
sored  data,  some  are  more  sensitive  to  particular  differences  between 
distributions  than  are  others.  We  have  chosen  to  use  a  test  developed 
by  Breslow  [1],  which  is  particularly  sensitive  to  differences  in  the 
hazard  rates  which  occur  early  in  time,  (The  hazard  rate  is  defined  as 
f(t)/[l-F(t) ] ,  where  f(t)  is  the  probability  density  function  and  F(t) 
is  the  cumulative  distribution  function.)  This  is  a  generalized  version 
of  the  two-sample  Wilcoxon  test,  which  may  be  found  in  any  text  on  non¬ 
parametric  statistics.  It  can  be  used  with  any  number  of  samples  and 
is  applicable  to  censored  data.  The  actual  test  procedure  is  fairly 
detailed  and  is  not  given  here.  A  complete  discussion  may  be  found  in 
the  indicated  reference. 
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Under  the  null  hypothesis  that  k  populations  are  equal,  Breslow's 
test  statistic  has  an  asymptotic  \2  distribution  with  k-1  degrees  of 
freedom.  When  the  quitters  are  dropped  from  the  samples,  the  test  sta¬ 
tistic  is: 


X*  =  10.269;  p  =  .0361  . 

This  is  a  very  conservative  result  since  the  deletion  of  quitters  from 
the  samples  tends  to  equalize  the  five  distributions.  The  conditions 
with  the  highest  rate  of  emesis  also  have  the  most  quitters.  On  the 
other  hand,  while  it  seems  certain  that  most  of  the  quitters  quit  because 
they  were  becoming  nauseated,  the  actual  times  of  emesis  might  have  been 
much  later  if  they  had  continued  in  the  experiment.  Therefore,  inclusion 
of  the  quitters  tends  to  overemphasize  the  differences  between  the  distri¬ 
butions,  giving  a  liberal  significance  level.  The  test  statistic  cal¬ 
culated  with  quitters  included  is: 

X2  =  16.060;  p  =  .0029  • 

2 

The  test  statistics  given  above  provide  substantial  evidence  in¬ 
dicating  that  the  five  conditions  of  the  Correlation  Study  yield  different 
rates  of  emesis.  Therefore,  further  investigation  of  those  differences 
is  warranted.  Since  a  question  of  primary  interest  is  which  conditions 
are  most  severe,  we  performed  all  possible  pairwise  comparisons  of  the 
distributions.  The  p-values  for  those  tests,  with  and  without  quitters, 
are  given  in  Table  4.  Condition  V  had  the  highest  observed  motion  sick¬ 
ness  incidence  and  from  the  table  can  be  seen  to  be  significantly  more 
severe  than  the  mildest  motion  conditions.  Even  without  the  inclusion  of 
quitters,  there  is  a  significant  difference  between  Conditions  II  and  V. 
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(a)  p-Values  of  Pairwise  Tests  Without  Quitters 


I 

II 

III 

IV 

II 

.4549 

III 

.4460 

.0972 

IV 

.4851 

.0599 

.7791 

V 

.0707 

.0011 

.3091 

.1145 

(b)  p- 

-Values  of  Pairwise  Tests 

With  Quitters 

I 

II 

III  IV 

II 

.4805 

III 

.4666 

.1152 

IV 

.1820 

.0122 

.5609 

V 

.0172 

.0001 

.0626  .1197 

Table  4:  Pairwise  Tests  for  the  Equality  of 

Distributions  in  the  Correlation  Study 


With  quitters  included,  the  comparison  of  Conditions  I  and  V  is  also 
significant,  as  is  the  comparison  of  Conditions  II  and  IV. 


Since  the  sample  sizes  used  in  the  Correlation  Study  are  relatively 
small,  it  is  difficult  to  detect  subtle  differences  between  the  five 
motion  conditions.  Nonparametric  tests,  in  particular,  are  not  very 
powerful  when  used  with  small  samples.  However,  the  Correlation  Study 
was  intended  to  be  mostly  exploratory  in  nature.  As  such,  it  has  helped 
to  point  out  some  of  the  effects  of  complex  waveform  motions  and  indicate 
possible  avenues  for  future  research. 
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IV.  SUMMARY 


In  this  investigation  we  have  examined  12  sets  of  motion  sickness 
incidence  data.  Seven  of  those  data  sets  were  taken  from  various  single¬ 
frequency  studies  while  the  rest  of  the  data  comprised  the  five  conditions 
of  the  Correlation  Study,  four  of  which  were  dual-frequency  motion  con¬ 
ditions.  Each  of  the  experiments  limited  exposure  to  two  hours  duration 
and  each  used  frank  emesis  as  the  experimental  endpoint. 

In  section  II  of  this  report  we  have  shown  that  time  to  emesis 
data  may  be  described  by  a  statistical  mixture  model.  The  first  pop¬ 
ulation  in  the  mixture  is  made  up  of  the  subjects  who  become  sick  with¬ 
in  2  hours  and  follows  a  Weibull  probability  distribution.  We  have  in¬ 
cluded  the  quitters  in  that  population.  The  second  part  of  the  mixture 
accounts  for  those  who  do  not  become  sick  within  two  hours.  The  distribution 
of  that  part  of  the  mixture  cannot  be  determined  from  the  available  data. 

Both  lognormal  and  Weibull  distribution  functions  have  been  fit  to 
the  uncensored  portions  of  the  samples.  These  functions  have  been  plotted 
along  with  the  empirical  distribution  functions.  The  plots  illustrate 
the  differences  in  the  way  the  two  functions  fit  the  data  and  give  further 
evidence  for  the  choice  of  the  Weibull  over  the  lognormal  distribution 
as  a  model  for  time  to  emesis. 

In  section  III  we  have  used  a  nonparametric  test,  developed  by 
Breslow  [1],  to  compare  the  five  conditions  of  the  Correlation  Study. 

The  comparisons  have  been  made  both  with  and  without  the  inclusion  of 
quitters  in  the  samples.  In  either  case,  the  overall  test  that  the 
five  conditions  are  equal  rejects  at  the  a  »  .05  level  of  significance. 


The  evidence  for  rejection  of  this  hypothesis  is  much  stronger  when 
quitters  are  included.  Pairwise  comparisons  of  the  conditions  have 
also  been  made,  showing  that  Condition  V  is  significantly  more  severe 
than  the  mildest  conditions.  For  the  most  part,  however,  we  cannot  dis¬ 
tinguish  between  the  conditions  on  the  basis  of  the  test  used  here.  This 
may  be  a  result  of  the  fact  that  the  test  is  not  very  powerful  for  the 
sample  sizes  being  used. 
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V.  APPENDIX:  DATA  LISTING  AND  FIGURES 


The  data  given  here  were  taken  from  a  number  of  experiments  per¬ 
formed  using  the  ONR  motion  generator.  Our  source  for  the  first  seven 
data  sets  was  [5].  The  data  for  the  Correlation  Study  is  given  in  [6], 
with  the  exception  that  the  times  for  the  quitters  were  not  reported 
there.  We  obtained  those  times  from  the  authors.  The  times  are  given 
in  minutes  and  those  for  quitters  are  marked  with  an  asterisk.  In  the 
identification  of  the  data  sets,  f  denotes  frequency  (Hz) ,  a  denotes 
rms  acceleration  (g) ,  and  N  is  the  total  number  of  subjects  in  the 
experiment. 
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Figure  1:  Comparison  of  fitted  lognormal  and  Weibull  cumulative  distribution 
functions  to  empirical  distribution  function:  f=.167,  a=.lll  . 


Figure  2:  Comparison  of  fitted  lognormal  and  Weibull  cumulative  distribution 
functions  to  empirical  distribution  function:  f=.167,  a=.222  . 
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Figure  3:  Comparison  of  fitted  lognormal  and  Weibull  cumulative  distribution 
functions  to  empirical  distribution  function:  f=.250,  a=.lll  . 
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Figure  5:  Comparison  of  fitted  lognormal  and  Weibull  cumulative  distribution 
functions  to  empirical  distribution  function:  f=.333,  a=.222  . 


lognormal 


Figure  7:  Comparison  of  fitted  logpormal  and  Weibull  cumulative  distribution 
functions  to  empirical  distribution  function:  f=.500,  a=.444  . 


Figure  8:  Comparison  of  fitted  lognormal  and  Weibull  cumulative  distribution 

functions  to  empirical  distribution  function:  Condition  I  of 
Correlation  Study. 


Figure  9:  Comparison  of  fitted  lognormal  and  Weibull  cumulative  distribution 
functions  to  empirical  distribution  function:  Condition  II  of 
Correlation  Study. 


Figure  10:  Comparison  of  fitted  lognormal  and  Weibuli  cumulative  distribution 
functions  to  empirical  distribution  function:  Condition  III  of 
Correlation  Study. 


Figure  11:  Comparison  of  fitted  lognormal  and  Weibull  cumulative  distribution 
functions  to  empirical  distribution  function:  Condition  IV  of 
Correlation  Study. 
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Figure  12:  Comparison  of  fitted  lognormal  and  Weibull  cumulative  distribution 
functions  to  empirical  distribution  function:  Condition  V  of 
Correlation  Study. 
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